On Watts' Cascade Model with Random Link Weights 



T. R. Hurd 1 , James P. Gleeson 2 
Department of Mathematics, McMaster University, Canada 
2 MACSI, Department of Mathematics & Statistics, 
University of Limerick, Ireland 

November 27, 2012 



Abstract 

We study an extension of D. Watts 2002 model of information cascades in social 
networks where edge weights are taken to be random, an innovation motivated by re- 
cent applications of cascade analysis to systemic risk in financial networks. The main 
result is a probabilistic analysis that characterizes the cascade in an infinite network 
as the fixed point of a vector-valued mapping, explicit in terms of convolution inte- 
grals that can be efficiently evaluated numerically using the fast Fourier transform 
algorithm. A second result gives an approximate probabilistic analysis of cascades on 
"real world networks" , finite networks based on a fixed deterministic graph. Exten- 
sive cross testing with Monte Carlo estimates shows that this approximate analysis 
performs surprisingly well, and provides a flexible "microscope" that can be used 
to investigate properties of information cascades in real world networks over a wide 
range of model parameters. 
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1 Introduction 



The main concern of the present paper is to study a specific generalization of the well- 
known random cascade model introduced by Duncan Watts |2002| that represents the 
spread of an "infectious idea", such as a fad, a rumour, a song, a political opinion, 
or a new technology, on a random social network. In Watts' model, individuals and 
their friendship relations are modelled as nodes and undirected links of a "random 
network" , i.e. a generalization of the random graph model of Erdos and Renyi [1959 . 
Each node is assumed to have a deterministic or random threshold <3?, such that a 
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node will "adopt" the new idea as soon the number of its friends already adopting this 
idea exceeds the threshold <£. Watts showed under some strong assumptions that the 
new idea will "percolate" or "cascade" through the infinite network. More precisely, 
this means that the set of adopting nodes triggered by any infinitesimal fraction of 
seed nodes will grow to a positive fraction of the network with positive probability if 
and only if a certain analytical condition on the network parameters holds. 

In the present paper, we no longer assume that the strengths of social interactions 
between friends (i.e. linked nodes) are equal and deterministic, but instead take 
edge weights to be random variables whose distributions may be dependent on the 
connectedness of the nodes. This means that if I denotes a social link between two 
friends who have friendship degrees k, k' (that is, these friends each have in total 
k and k' friends respectively), we take the strength of their friendship to be Wf, a 
random variable drawn from a distribution Gtk' that depends on k,k'. This new type 
of randomness reflects a genuine uncertainty we have in our observations of real social 
networks. 

The problem of random edge weights appears to be little studied in the vast 
literature that has built up around the Watts Cascade Model. However, the theory 
has moved substantially in other directions and promising new areas of application 



have been found in Watts and Dodds 2007 , Dodds and Payne 2009 , andDodds et al. 



2012 . These developments are reviewed in Newman 2010 . The Watts percolation 



theory itself has been extended to far more general random graph ensembles, for 



example in Boguha and Serrano [2005 



Our motivation to study the Watts model in this way comes from a relatively new 
thread of economic research into "systemic risk" in financial networks. This term 
refers to financial system crises that involve domino-type cascades of defaulting or 
insolvent banks, triggered by an initial shock. Eisenberg and Noe| |2001| provided a 
highly stylized network picture of a system of banks with balance sheets that distin- 
guish external and interbank lending and borrowing. In this network, each directed 
edge between banks has a weight that denotes the size of an interbank loan. An initial 
shock to the balance sheets of the system may cause the insolvency of some banks. 
Insolvent bank assets are liquidated at less than their book value, causing balance 
sheet shocks to their creditor banks. Such shocks may lead to secondary insolven- 
cies: understanding the resultant "insolvency cascade" is the focus of systemic risk 



research. Nier et al. 2007], in an influential paper, provided a random specification 



of this type of system, and through Monte Carlo simulations, discovered a number of 
non-intuitive, non-monotonic relationships between model parameters and the resul- 
tant cascade size probability distribution. Gai and Kapadia [2010| then introduced 
an elegant systemic risk model based on a directed graph version of Watts' setup, 
and by adapting Watts percolation results, were able to determine cascade sizes by 



analytical means. This paper was extended in several directions in Amini et al. 2012 



and Hurd and Gleeson 2011 , who proved more general versions of the percolation 
theorems. 

This new analytical theory of systemic risk hits a major roadblock whenever a 
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realistic specification of interbank lending is made: until now, such models have not 
able to handle edge weights that are random or depend deterministically on the size 
of banks. It is this roadblock in understanding systemic risk that we address here. 
To keep everything as simple as possible, we turn away from financial systems mod- 
elled by directed networks, and focus instead on information cascades in undirected 
networks that is the standard interpretation of Watts model. 

Our version of the Watts undirected cascade model allows for a lot of modelling 
flexibility, since the underlying network (the "skeleton graph"), the adoption thresh- 
olds, the edge weights and the initial set of "seed nodes" are all allowed to be stochas- 
tic. One might take the view that the theory is now far from parsimonious. Making 
use of this additional flexibility in practice comes at a cost: statistical estimation 
or "calibration" of the myriad parameters to any specific real world network may 
turn out to be prohibitively difficult. It is this concern that lead us to the second 
main contribution of our paper: we are able to specialize the framework to any deter- 
ministic finite skeleton graph, thereby reducing complexity, whenever one accurately 



observes the nodes and links (as one does for example in Facebook networks, Traud 



et al. [2011] ). The resultant picture is then of a known "real world network", but 



with randomness in its node thresholds, edge weights, and initial shocks that directly 
reflects our remaining uncertainties. As we will see in this paper, the Watts type 
cascade analysis extends naturally to this more realistic setting. 

Modelling real world networks directly in this way contrasts with prior work on real 



world networks that makes use of the infinite network analysis, Melnik et al. [201 1 



Typically, these papers start with network data thought of as an empirical probability 
distribution and as a first step, match this empirical distribution to an estimated 
distribution, taken from a parametric family of infinite random graph models. The 
cascade analysis is then applied to this estimated infinite graph model. Finally, the 
resultant cascade probabilities are taken as indicators of the cascade risk inherent in 
the original real world network. The conclusions of such an indirect approach will 
likely be weak, since the structure of a given real observed network is often very unlike 
any of the standard families of infinite random graph models. 

By comparison, our probabilistic cascade analysis works directly on the empiri- 
cal finite network. Whenever the analytic hypotheses are verified, or shown to have 
insignificant impact, we obtain measures of cascade risk that directly reflect the ac- 
tual topology and structure of the real world network. We can then hope that our 
predicted cascade measures will accurately reflect reality. In this paper we will see 
whether this hope is justified by comparing our analytical formulas to Monte Carlo 
estimates under a range of modelling specifications. 

During this work certain subtleties concerning the dependence amongst random 
variables lead us to reevaluate the logical and probabilistic arguments for cascading 
that are typically invoked in random graph theory. In the literature, for example 



Newman 2010 , such arguments have rested heavily on the use of generating functions 



and underplayed the role of conditional probability. The logical framework is certainly 
elegant and efficient. However, we have found it preferable to use a more direct and 
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transparent probabilistic style of argument initiated in Hurd and Gleeson [2011 . It 
would be a demanding exercise to translate our probabilistic analysis into the more 
familiar generating function language! 

Our analytical results only hold under an important and restrictive assumption 



that generalizes a property of the random configuration graph model of Bollobas 



1980 that was the setting of Watts' original paper. Called the locally tree-like inde- 



pendence assumption (LTIA) in Hurd and Gleeson |2011j, this is the formal equivalent 
to the so-called "mean-field" assumptions often made in the literature. It holds in 
the infinite graph limit in the class of configuration models, but fails to hold in many 
other important classes of random graph models. The second technical innovation 
of our paper, the WOR condition, makes sense only under the LTIA assumption. It 
provides the key to finding the somewhat subtle unravelling of dependent random 
variables needed to prove Theorem [TJ 

The flexibility of our analytical framework opens up a lot of questions for study. 
Space permits us to address only a few: Do we observe qualitatively new network 
effects such as phase transitions, due to the randomness of edge weights? Do results 
from the infinite graph setting accurately reflect what is observed in finite networks? 
When does the LTIA assumption lead to accurate results for real world networks? 
We hope readers will find the methods developed here compelling enough to program 
for themselves, and invite them to use such methods as a computer "microscope" to 
investigate further detailed properties of random networks. 



2 The Extended Watts Cascade Model 



The famous paper Watts 2002 introduces a model of a social network (network of 
"friendships"): the problem is to study if and how a newly introduced technology 
"percolates" through the network. Such information cascades can be viewed as a 
model for the spread of ideas, rumours, a social fad, or any number of analogous phe- 



nomena. This paper has important antecedents in the literature, notably Granovet- 



ter 



|1978| and Schelling 1973] (see also Easley and Kleinberg 2010] and references 
therein). The setup of the original model starts with a "skeleton graph" on N nodes 
labelled by N = {1, 2, . . . , N} (where N may be infinite), that is, an undirected ran- 
dom graph £ C M X J\f. Each undirected link of the graph represents the existence 
of a friendship connection between two individuals or nodes. For the next layer of 
model structure, each node v E M is assigned a random threshold Q v £ K. At the top 
level, a random subset Aq £ M of "early adopter" nodes is selected that represent 
those individuals that adopt the new technology before being influenced by friends. 
For each realization of the above set of random variables, the cascade describing the 
adoption of technology by nodes proceeds through a sequence of deterministic steps. 
At each step, a node v "adopts" if at least <J>„ of its neighbours were in "adopted" 
state in the previous step. The primary aim of the model is to characterize the prob- 
ability of nodes eventually adopting the technology, and in particular to determine 
the "phases" in which the new technology will be taken up by a large fraction of the 
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network. 

There are many possible generalizations of the Watts model. In this paper we 
focus on the particular generalization where the edges I are given random weights 
W£ that describe the degree of friendship (or influence) between friends. The random 
nature of these weights reflects the uncertainty we have in knowing the strength of 
friendships in a network. One is now interested in questions such as: 

• Is there a major effect on the cascade if we has fat tails (such as a Pareto 
distribution)? 

• Does the percolation picture of cascades survive in this generalization? 

• Are the cascade results computable in closed form, or must they be replaced by 
numerical techniques? 

• Do random edge weights lead to interesting new phase structures as model 
parameters are varied? 

The full specification of the extended model consists of the following ingredients. 

1. The skeleton graph £ is a random Bollobas configuration graph with a specified 
degree distribution P\~ = P[deg(t>) = k], k G JC. For simplicity, we assume that 
K. is a finite set of non-negative integers. We also let z = Y^k ^Fk denote the 
mean degree. 

2. The random variables & v G M + have cumulative distribution functions (CDFs) 
Fk{x) = P[3>t; < x\k v = k] that depend only on the degree k v . We suppose that 
the set of initially adopting nodes Ao is defined by the condition = 0, which 
occurs with probability = -Ffc(O) if k v = k. 

3. The random weights w vv > G M + for edges (v,v') G £ have CDFs Gkk'{x) which 
depend on the deg 

4. The condition for a node v to have "adopted" at the nth step of the cascade, 
written v G An, is 

W v'v > $v (1) 

v'eAfvClAn-1 

for any n > 0. 

Finally, we assume a strong form of conditional independence: conditioned on the 
skeleton graph £, the family of random variables {& v , w vv /} for all v, v' form a mutu- 
ally independent collection. 

Note that the original Watts model arises by taking £ to be a configuration random 
graph with general degree distribution, we = 1 deterministically and Fk(x) = + 

(1 — A^)F(x/k) for a single function F(x). 

Our aim is now to track the evolution of the cascade through the increasing 
sequence of sets of nodes: 

Ao->Ai->---->An->--- 
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and to determine the probabilities that a node of given degree eventually adopts: 



k 



n—>oo 



lim F[v G An\k v = k] . 



The mathematical analysis that follows is based on a single assumption we will 
make throughout this paper. 

The "locally tree-like" independence assumption (LTIA): Consider an ex- 
tended Watts model on A/" defined by a collection of random variables {£ , {$}, {w}). 
Let A/i,A/2 C Af be any two subsets that share exactly one node Ai D A2 = {v} and 
let X%, X2 be any pair of random variables where for each i = 1, 2, X$ is determined 
by the information on A/i. Then, conditioned on information located at the node v , 
that is conditioned on or fey, Xi and X2 are independent [j] 

LTIA is a concise formalization of conditions that appear extensively in the liter- 
ature under the name of "mean field" approximations. In our setting, two distinct 
model properties are captured by LTIA: 

1 . The random graph E should be tree- like in the sense that the density of triangles 
and higher order cycles is zero. This is true if E is an actual finite tree (if N is 
finite), or if it is drawn from an infinite Bollobas configuration model. 

2. The collection of random variables $ v ,w vv i form a mutually independent col- 
lection, conditioned on the random skeleton graph E. 

When the LTIA condition is exactly true, the analytical results we present here 
are exact. In practice, in a model where LTIA is not true, our analytical results are 
an approximation to the true behaviour in the model. 

Certain mathematical considerations arise in the following analysis. First, one 
needs to know that the degree distribution of v conditioned on v being the end-node 
of an edge is not P^ but 



where z = kP^ is the mean degree. 

The second consideration is to have a compact representation for the probability 
that X > Y for two independent non-negative random variables. Let (•, •) denote the 
standard Hermitian inner product on functions on (—00,00): 



^Thc following is a concise mathematical expression of this independence assumption. To any subset of 
nodes Af' C Af we associate the sigma-algebra Q' generated by the thresholds <I> U and degrees k v of nodes 
v € Af' and edges (v, v') in Af' x Af' . If we let Q\-,Qi-,Q v denote the corresponding sigma-algebras, then Gi 
and Q2 are independent sigma-algebras conditioned on Q v : that is, (Gi -LL G2) \Gv 



Qk = F[k v = k\v 6 N v i for some v'] 



kP k 



,kelC 



z 
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Assuming that X has a density fx{x) and the CDF Fy of Y is continuous, and letting 
these functions have Fourier transforms F(fx)iF(Fy), then by the Parseval Identity 

nx > Y] = f x (x)F Y (x)dx = (Fy, fx) = ^(HFv),Hfx)). (2) 

The last of these formulas will be useful in computing such probabilities when X is 
itself a sum of independent random variables X = X\ + X2 because whereas fx (x) 
is a convolution 



rx 

fx(x)= fx 1 (v)fx 2 (x - y)dy 
Jo 
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its Fourier transform is a simple product, T(fx) = J r (fx 1 )J r (fx 2 )- 

The main result of this section is a theorem that determines the double sequence 
of probabilities 

4^ =P[«6 A n \k v = k] 

beginning with the initial vector A? ,k E /C. The theorem is inductive on the steps 
n = 0,1,2, ... of the cascade. To motivate the result, we consider the defining 
property ([!]) of the set A n and note this implies 

4 n) =P[$„< Yl w v >v\k v = k]. (3) 
u'£M,n.4»-i 

The subtle point is that the random variables & v , A n -\ and w vv i occurring in this 
expression have a complicated dependence, and the probability cannot be evaluated 
directly using ([2]). The key to unravelling this dependence is a notion that takes care 
of the causal relations in the order of adopting nodes. We say that a node property 
V holds for a node v "without regarding" a neighbouring node v' £ M v , and write 
'"P holds for v WOR v hl , if this property of v holds in the subgraph of £ obtained by 
deleting the edge to v' and the entire connected subgraph rooted on this edge. It is 
important to note that the concept '"P holds for v WOR v'" makes sense only under 
the LTIA assumption: otherwise with positive probability the deleted subgraph will 
include v itself via a cycle in the network. 

We will show that ([3]) is expressible in terms of intermediate WOR probabilities 
defined by 

= F[v e A n WOR v'\k v = k, v' € M v ] (4) 
where v, v 1 are random nodes selected from J\f. At the outset, 

Ci 0) = F[$ v =0\k v = k, v' e M v ] = Af . (5) 
We begin by noting that for n > 0, v £ A n is true if and only if 

*« < E ^ 

v"£jV v 

where 

= W vv nl{ V " G An-1 WOR V) . 
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Next we observe that under the condition k v = k, LTIA implies that the k random 
variables wfy ^ for v" G N v are independent. They are also identically distributed 
with CDF given by 

P[4"~ x) <x\k v = k, v" g M v ] = p [*fc 1} ^ x \ k » = k > k "" = k "' v " G ^]Qk» 



k" 



Qk" ■ 



In the above step, we have used LTIA again: conditioned on k v = k, k v ii = k" , the 
random variables w vv " and l(v" G A n -i WOR v) are independent. Hence it follows 
that 

A k n) = P[ Wvv»l(v" G An-i WOR v) ><f> v \k v = k 

v"eAf v 

= (6) 

where ^{g^ ^) is the Fourier transform of 

3i n_1) M = E [0- - cjT X) Mx) + C^- l) g kkl {x)] Qy (7) 
k' 

and Jo denotes the Dirac delta function centred at 0. 

We now suppose the vector {C^™ 1 , k G /C} is known for some n > 0, and seek a 
formula for the vector C k . The logic is the same as in the previous paragraph, with 
one difference. Given k v = k, v' G JV V , the sum over v" is over the set N v \ v' instead 
of the set M v . It follows that 

ct ] = ^HFk)AHg { r 1] )) k - 1 ) (8) 

with g k n ^ given again by Q. 

The above arguments prove our first main result: 

Theorem 1. Consider an extended Watts model on N defined by a collection of 
random variables (£, {<&}, {w}) that satisfies LTIA. Then: 

1. The probabilities C { k n) = F[v G A n WOR v'\k v = k, v' G N v ] satisfy the recursive 
formula 

ct ] = ^m^Hot 1 ^- 1 ) (9) 

with Fk(x) = P[3>„ < x\k v = k], g k (x) given by Q. The initial values are 
=Af) =F[$ v = 0\k v = k}. 

2. The probabilities A k = F[v G A n \k v = k] are given in terms of ' C k l ^ by 

4 n) = ^HFk)AHg { r 1] )) k )- (io) 
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3. The probability that a node of degree k eventually adopts, A, , is given by 

(10) in terms of a vector C*- 00 -* = {c[°° }keK that satisfies the M. K valued fixed 
point equation defined by the recursion given in part (1). The existence (but 
not uniqueness) of such a fixed point is guaranteed by the Tarski-Knaster Fixed 
Point Theorem. 

Remark 2. It is interesting that nearest-neighbour correlations in this model are 
determined in terms of the C and A probabilities. In fact, under LTIA, the joint 
probability of eventual non-adoption of a nearest neighbour node pair v, v' with k v = 
k,k v ' = k' is simply: 

F[v, v' <£ Aoo\K = k, k v , = k'] = F[v $ WOR v', v' <£ A x WOR v\k v = k, k v , = k'] 

= [i - C^}[1 - C^] . 



A result of this type is found in Gleeson and Melnik t 2009K 



It is well-known in network science, see e.g. Newman 2010| [Chapter 13.11], that 
the frequency of global cascades in infinite random graphs is related to the fractional 
size of the so-called in-component associated to the giant vulnerable cluster. In the 
present context, a vulnerable cluster has the meaning of a connected subgraph of the 
network consisting of "vulnerable" directed edges, where a vulnerable directed edge 
is an edge whose weight is sufficient to exceed the adoption threshold of its forward 
node. We define: 

• £y C £, the set of vulnerable directed edges; 

• £ s , the giant strongly connected set of vulnerable edges (the "giant vulnerable 
cluster" ) ; 

• £i, the "in-component" of the giant vulnerable cluster, i.e. the set of vulnerable 
edges that are connected to £ s by a directed path of vulnerable edges; 

• 1 — by := F[(v',v) G £%\k v i = k'], a conditional probability of an edge being in 
p.- 

• C'k'k = F[& v < w v i v \k v i = k! , k v = k], the conditional probability of a node being 
vulnerable. 

Now note that (v',v) £ £f (i.e. the complement of £A means either Q v > w v > v or 
<3?t, < w v ' v and all the k v — 1 "downstream" directed edges (v,v") are in the set ££. 
Thus, invoking the LTIA, one determines that 

b k , = 5^P[fc„ = k\k v > = k',v G Af^]F[(v',v) G £i\k v > = k',k v = k] 



k 

^kl\ 

Z 



k 



(1 - ay k ) + a k t k b\ 1 := h k >(b) 



In other words, the vector b = {b k } ke £ satisfies the fixed point equation b = h(b) 
where 

kP k r 

h k'0 3 ) = 2_ J (! - a k'k) + afc'fc^ -1 



k" G JC . (11) 
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The cascade condition is the condition on a random network that determines 
whether or not it has a giant vulnerable cluster, or equivalently, whether a randomly 
selected initial adopting node has a positive probability of triggering a cascade that 
reaches a positive fraction of the infinite network. As shown in a similar setting in 
Dodds et al. |2012 , Amini et al. 2012 and Hurd and Gleeson [201 1 , this boils down 
to a condition on the spectral radius ||D|| of the derivative matrix D = (-D/c'/Ofc'fceA;- 
Here D^k = dhk'/dbk is evaluated at the trivial fixed point b = [1, 1, ... , 1], and is 
given by an explicit formula: 



Dkk> 



a k i k k{k - l)P fc 



Recall that the spectral radius of a square matrix is the magnitude of its largest 
eigenvalue. 

Cascade Condition: If ||JD|| > 1, then with positive probability, a single random 
seed will generate a cascade that reaches a positive fraction of the infinite network. 
If ||.D|| < 1, then with probability 1, the cascade generated by a single random seed 
will be a zero fraction of the infinite network. 

For a global cascade to arise from a random single seed, it is sufficient for the seed 
v to trigger at least one edge (v , v 1 ) 6 Therefore, by conditioning on the degree of 
v one finds that the frequency of global cascades is bounded below: 



/>^[l-6|]P fc . 



fc>0 



3 Discrete Probability Distributions 

The structure of ^ and similar equations is problematic from the point of view of 
numerical approximations. Numerical evaluation of the implied integrals leads to 
truncation errors and discretization errors, both of which will be difficult to handle in 
our setting. In this section, we analyze the case where the random variables {^ v , wg} 
all take values in a specific finite discrete set Ai = {0, 1, . . . , (M — 1)} with a large 
value M. In such a situation, the convolutions in ^ can be performed exactly and 
efficiently by use of the discrete Fast Fourier Transform (FFT). 

Let X, Y be two independent random variables with probability mass functions 
(PMF) px,PY taking values on the non-negative integers {0,1,2,...}. Then the 
random variable X + Y also takes values on this set and has the probability mass 
function (PMF) px+Y = Px *Py where the convolution of two functions /, g is defined 
to be 

n 

(f*g){n) = ^2f(m)g(n-rn). (13) 

m=0 

Note that px+Y will not necessarily have support on the finite set A4 if px , Py have 
support on M.. This discrepancy leads to the difficulty called "aliasing" in the next 
section. 
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3.1 The Discrete Fourier Transform 

In this section we consider space C M of C- valued functions on .M = {0, 1, . . . , M — 1}. 
The discrete Fourier transform, or fast Fourier transform (FFT), is the linear mapping 
F :a= [a , . . . , OAf-i] eC M 4a = F{a) G C M defined by 

Ofc = ^2 (kiai ,k G M . 
leM 

where the coefficient matrix Z = (Cm) has entries = e ~ 2ntkl / M . 

It is easy to prove that its inverse, the "inverse FFT" (IFFT), is given by the map 
a — >■ a = Q(a) where 

«fe = ^^ (kiai ,k G M . 
leM 

If we let a denote the complex conjugate of a, we can define the Hermitian inner 
product between 

(a, b) := ^ °<rnbrn ■ 
meM 

We also define the convolution product of two vectors: 

(a * b)(n) = K n ~ 771 modulo M), n G M. . 

meM 

Now we note the following easy-to-prove identities which hold for all a, b G C M : 
1. Inverse mappings: 



a = G{T(a))=F(G{a)) ; 



2. Conjugation: 

3. Parseval Identity: 



I 



(a,b)=M{a,b) = ^(a,b) 



4. Convolution Identities: 

a -*b = (a * b) ; a -*b = (a * b) 

where ■* denotes the component-wise product. 

We note that if the support of the sum of two .M-valued random variables X, Y is 
itself in M, that is, if supp(X + Y) = {n\3m G supp(X)s.t. n — m G supp(y)} G M, 
then px+Yi the PMF of X + Y, is given by px * Py- On the other hand, if there 
are m G supp(X), n G supp(Y) such that m + n > M, then px+Y — Px * Py is not 
zero: such a difference is called an "aliasing error" . In what follows we assume that 
all aliasing errors can be made zero by choosing M sufficiently large but finite. 
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We now consider the probability 



P = P[Y<^2X Z ] . 



i=\ 

for independent random variables Y, X\ , X2 , • • • , Xf. where Xi are distributed with 
PMF gi{m) and Y has PMF f(m). We suppose that / has support on M. while 
each gi has support on {0,1,..., [M — l/k\}. We also define the CDF F(n) = 
El=o/( m ) = < n) for n £ M. Then using the above identities, we are lead 
to an efficient means to compute P for large k involving matrix operations and the 
FFT: 

k m 

= P[Y < J2 X i\ = (01 * ' ' ' * 9k){m) f( n ) = £ (91 * ■ ■ ■ * g k ){m)F{m) 
= {F,g 1 *---*g h ) = —(F,(gi*---*g k )) = —(F)'*(gi-*...-*g k ). (14) 

Here in the final expression, A' denotes the conjugate transpose of a matrix A, * 
denotes matrix multiplication between a size [1, M] matrix and a size [M, 1] matrix, 
and •* denotes component-wise multiplication of vectors and matrices. 



Remark 3. There is no aliasing problem and the identity (14) is true if gi * ■ ■ ■ * g k 



has support on Ai. We formalise this requirement as the Aliasing Assumption. 

Remark 4. The overall computation time for any numerical implementation of the 
cascade formulas in Theorem^ will be dominated by computing convolutions such as 



those in ( 10 ) . In particular, one can check that negligible total time will be taken in 
computing FFTs ( each of which take of the order of M In M additions and multipli- 
cations). One can compare the efficiency of our recommended FFT approach to the 
direct approach by considering a single evaluation of the convolution: to compute f*g 
using ( 13 ) for M -vectors /, g requires M 2 additions and multiplications, whereas to 



compute f ■ *g requires only M multiplications. All else being equal, we can expect 
a speedup by a factor of O(M) when the FFT method is used instead of the direct 
method. Since in our implementations we often have M > 2 10 ; this is a huge im- 
provement. Of course, this factor is too optimistic: it will be worsened considerably 
when one takes care of the aliasing problem. 



4 Fixed Random Graphs 

We now consider how the formalism of Section 2 can be applied in an approximate 
fashion in the important special class of problems when the skeleton graph is actually 
known (i.e. deterministic) and finite while the thresholds and edge weights are ran- 
dom. This will allow us to address the question of systemic risk in tractable models 
of real observed financial networks. 
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Let A = (A vv '),v,v' G J\f be the symmetric adjacency matrix of the fixed undi- 
rected graph £. We number the nodes in M by v = 1,2, ... ,N and the links in 8 by 
t = 1, 2, . . . , L where L = Yli<v<v'<N A™'- The discrete threshold and edge weight 
random variables $„, W£ are assumed to have support on the set M. = {0, 1, . . . , M — 
1}. The probability that node v is an early adopter is A^ ] = F[<S> V = 0] = F[v G A \. 
The CDF of <$> v is an M- vector F v (m) = P[$ v < m] for m e M, and we let 
F v = J-{F V ) denote its discrete Fourier transform (FFT). Similarly, g^{m),m G Ai 
denotes the FFT of the M- vector gi{m) = P[we = m], the probability mass function 
of W£. To ensure LTIA, it is necessary but not sufficient that {& v ,Wi} is a collection 
of mutually independent random variables. The LTIA is exact only when £ is a tree 
graph or if the edge weights and thresholds are deterministic. 

The aim of this section is to use LTIA to derive an approximate formula for the 
marginal likelihoods A^°^ := P[v eventually adopts]. In the general case, we expect 
that the approximation that takes the dependence between different branches of the 
random graph to be negligible should improve as gets large, all else being equal. 
We will find that the dependence of the accuracy on other parameters is not easy to 
understand, but that overall assuming LTIA leads to surprisingly good performance. 

The analysis will determine the required probabilities A^ = P[v G A n ] via the 
intermediate probabilities 

= P[u G A n WOR v'] 

that will be computed for each of the 2L directed links I = (v, v') and n = 1,2, . . . . 
At the outset, C^, = A^ . 

The analysis for a finite real graph exactly parallels the argument presented in 
Section 2. First we note that v G A n for n > means that 

y~] Wv",v > §v 

where 

w v " iV := w vv 'il(v" G A n -i WOR v) . 

By LTIA, we assume that w vv » and l(v" G A n -\ WOR v) are independent for all 
v" G N v . Therefore, the sum of the w over v" is a sum of independent random 
variables, each with moment generating function (MGF) [^] 

g^H = (1 - C$f) + C$fg v „, v (m),me M . (15) 

From these facts, and ( |14[ ), and making use of the Aliasing Assumption (Remark [3]), 
we deduce the formula for Av '■ 

At) =p[ y, ™*>,v > *«] = (f v , n air v l) ) . (i6) 

v"eU v v"<EAf v 
2 Note: we have used the fact that the MGF of the random variable is the M- vector [1, 1, . . . , 1]. 
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When it comes to computing in terms of the set {C^ n_1 ^}, we may repeat 
this argument. The only change is in the sum over v": it now runs over N v \ {v'} not 
N v . It follows that 

C$ = (£> II (17) 

v"&Nv\W} 

The above argument rests on two assumptions, and leads directly to the main 
result concerning real world graphs: 

Theorem 5. Consider an extended Watts model onM on a fixed finite graph £ with N 
nodes and L edges. Let the thresholds and edge weights be a collection of independent 
random variables {<fr v ,wi} supported on A4 = {0, 1, . . . , M — 1}. Assuming LTIA and 
the Aliasing Assumption (Remark^!, the following formulas hold: 

1. For each n > and directed edge v, v' , the probabilities C^™, = P[v G A n WOR v'] 
satisfy the recursive formula 

r»( n ) — I P TT ^.(n— i) ^ 

v"dMv\{v'} 

with F v (m) = P[& v < m] and g^, J\x) given by (15). The initial values are 



C1 0) V ,=A { ° = F[$ V = 0}. 
2. The probabilities A { v n) = F[v £ A n ] are given in terms of the collection ci ^ 



by (16). 



3. The probability that a node v eventually adopts, A^\ is given by (16) in terms 
of the 2L-vector C^ 00 ** = {C^°}} that satisfies the M. 2L -valued fixed point equa- 
tion defined by the recursion given in part (1). The existence (but not unique- 
ness) of such a fixed point is guaranteed by the Tarski-Knaster Fixed Point 
Theorem. 

Remark 6. Caveats: In practice, we can arrange for the Aliasing Assumption to 
be approximately true by choosing M large enough that the probabilities 
^[J2v"eAf '&v",v ^ M] are kept sufficiently small: aliasing errors of this order of 
smallness can thus be expected to appear in any implementation of our formulas. 
However, we cannot generally arrange for LTIA to be even approximately true. The 
outstanding difficulty in understanding the true nature of the adoption cascade will 
therefore be in understanding the biases and errors that relate the LTIA formulas to 
the "true values" that can only be estimated by slow Monte Carlo simulations of the 
real world network. 

The percolation argument of Section 3 has a straightforward analogue for real 
world graphs that satisfy the LTIA approximately. As before we define: 

• vulnerable directed edges t = (v',v) by the condition < W v » v , £y to be the 
set of such edges, and ag = P[£ £ £y]; 
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• £ s , a strongly connected cluster of vulnerable edges; 

• Si, the "in-component" of £ s ; 

• 1 - b e = F[£ e Si] . 

Treating LTIA now as an approximation, one can then show that 

b v 'v ~ a v'v J [ Kv" + (1 _ a v'v) ■ 
v"EAf v \v' 

That is, the sequence b = {b v > v } is an approximate solution of a fixed point equation 
b = h(b). Direct differentiation at the trivial fixed point b = [1, 1, . . . , 1] yields the 
result 

a e if£'e£i\£ 
else 



Dc 



When the spectral radius of D is greater than 1 we can expect to see large cascades 
from the initial random seed with frequency approximately 



U&/V 



1 ~ n bvv> 



5 Numerical Investigations 

In this section, we report results of some numerical experiments we have made testing 
the above analysis. We divided our experiments into those that addressed the infinite 
random graphs of Section 2, and those that addressed real world graphs as described 
in Section 4. In our numerical implementations, all random variables are taken to 
be either deterministic or discretized lognormals. The lattice spacing 5x and lattice 
size M were always chosen carefully to reduce any bias between the Monte Carlo and 
analytic implementations. 



5.1 Experiment 1: Infinite Graphs 

For simplicity, we considered a Poisson random graph model as used in the experi- 



ments of Watts |2002 , with Poisson degree distributions parametrized by z G [0, 10] 



and threshold CDFs 



F k {x) = 4 0) + (1 - A? l )l(x > fakv) 



(oh 



with 4>* = 0.18 and a[ = 10 -4 . In Experiment 1A, the only change was to allow 
the weight of an edge to depend deterministically on the degrees of its nodes, i.e. 
its "type", while keeping the average edge weight to be 1. We treated the original 
specification W£ = 1 as the benchmark case, and two additional cases where the 
weight w = w(k,k',a) of an edge of type kk' is proportional to (kk')~ a for some 
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a. The first alternative with a = —0.5 corresponds to a social network in which the 
strength of edges increases as the connectivities k, k' increase. The second alternative 
with a = 0.5 corresponds to a social network in which the edge strength of nodes 
decreases as the connectivities k, k' increases. The case a = recovers the original 
benchmark case. 

Figure [T] shows the results. First we note that for the default case a = 0, the curve 



replicates the results shown in Figure 2b of Watts 1 2002 . The case with a < does 



not differ dramatically from the constant weight case. Both show a lower cascade 
threshold very near to z = 1. 

In contrast, in the case with a > 0, where highly connected nodes have relatively 
less influence on their friends, we see clearly that the cascade window is shifted to 
higher values of z. The upward shift in the lower cascade threshold away from z = 1 
is somewhat surprising, and begs an intuitive explanation. The expected transition 
near z = 1 for a < coincides almost exactly with the percolation point at z = 1 for 
Erdos-Renyi (ER) random graphs because, as one can check, for these values of a and 
z ~ 1 the vulnerable edge condition for Sy is satisfied with high probability for all 
nodes. By contrast, when a = 0.5 and z ~ 1, there is a significant positive probability 
for non-vulnerable edges of low degree, and consequently a significant upward shift 
in the transition point. 

In Experiment IB, we explored the effect of additional randomness, by taking 
the random variables to be lognormally distributed, with means w = w(k,k',a) 
matching Experiment 1A and standard deviation 0.5 x w, whenever the type of £ 
is (k,k'). Surprisingly, the effect of randomness is counter to our expectation that 
the critical cascade region would shift to lower z: as Figure [2] shows, the opposite 
behaviour occurs. 

In both Experiments 1A and IB we also compared the N = oo analytical results 
to large scale Monte Carlo estimates computed for N = 10 5 , Figures [l] and [2] show 
the expected excellent agreement. 

In Experiment 1C, we observed the dependence of the average cascade size 
on z and <j> when the edge weights were lognormal with mean and standard deviation 
(1, 0.5). We can see in Figure|3]that a jump discontinuity in A^°°^ arises as a function 
of (ft, as z increases above 3.5. In the next section we will observe that this discon- 
tinuity is strongly associated with discrepancies between the LTIA analytics and the 
true values for finite real world networks. 



5.2 Experiment 2: Finite Erdos-Renyi Graphs 

The aim of Experiment 2 was to investigate how the analytical formulas of Theo- 
rem [5j proved under the LTIA assumption, compare to the results of Monte Carlo 
simulations, on specific real world networks. In Experiment 2A we considered twelve 
Erdos-Renyi skeleton graphs, of three sizes = 10, 100, 1000 and four different val- 
ues of the connectivity parameter z = 1.5, 3.5, 5.5, 7.5. In each case, the default seed 
was a random set of N/10 nodes, and the weights were log- normally distributed with 
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mean and standard deviation (1, 0.5). The threshold distribution for a node of degree 
k was log- normal with mean and standard deviation {4>k, (j)k/2) for variable (f>. 

In each of the three subplots of Figure |4j the four curves show the analytical values 
of mean cascade size a functions of the threshold parameter <p, for the four different 
values of z. The three subplots are for the three different values of N. On each figure, 
the circles and squares show the corresponding Monte Carlo values computed with 
10000 realizations. 

The Monte Carlo values shown on these figures give a very clear demonstration 
of how the N — > oo limit as shown in Figure [3] is approached through finite Erdos- 
Renyi graphs. One sees little discrepancy when N = 1000, while for smaller N the 
discontinuities in <fi are smoothed out over a wider range of (f>. When one compares 
analytic results derived from Theorem [5] to the Monte Carlo values, one finds dis- 
crepancies over a wide range of <j> when N = 10. However, for N = 1000, the range 
over which a significant discrepancy occurs has narrowed considerably, and is con- 
fined to the range where the cascade size varies rapidly with (p. Further numerical 
experimentation shows that this type of behaviour is quite typical. 

Experiment 2B tested whether the LTIA analytic formula can accurately deter- 
mine adoption probabilities on a node-by-node basis. The four panels of Figure [5] 
show results for a single Erdos-Renyi skeleton graph with iV = 100 and z = 3.5 
and four different threshold parameter values eft = 0.3,0.4,0.5,0.6. The panels plot 
the node-by-node probabilities as computed by both analytic (red crosses) and 
Monte Carlo methods (blue pluses), in increasing order of their analytic values. Taken 
together, they back up our general conclusion that the LTIA analytics show reason- 
able accuracy on a node-by-node basis, and that accuracy improves as one moves 
away from medium cascade regions. The green circles represent the node degree as a 
fraction of the maximum node degree: one notices that there is very little indication 
that node degree is significantly correlated with adoption probability. 

Remark 7. We also ran consistency checks to verify that the Monte Carlo estimated 
cascades were unbiased estimators of the LTIA analytic cascades for a wide range of 
finite tree graphs. We created graphs similar to Figure [5| that in all cases showed the 
almost perfect agreement one expects for tree graphs. These results are not shown 
here. 

6 Concluding Remarks 

The main result of this paper, the Cascade Theorem [T] for models with random 
edge weights, is not a straightforward extension of original Watts Model. Unlike 
earlier versions of the theory, the inductive derivation of Theorem [T] requires a subtle 
decoupling of the dependence of edge and threshold random variables. The key to this 
decoupling is the introduction of both the "locally tree-like assumption" (LTIA) and 
the "without regarding" (WOR) condition. We believe that this particular argument 
is new in the literature. For simplicity we have confined the discussion to information 
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cascades: it is clear that the ideas extend to the more complex models of cascades in 
financial systems. 

We have shown both in infinite networks and in finite real world networks that 
the analytics of Theorems [T] and [5] give correct answers where appropriate, and are 
a meaningful approximation where the LTIA assumption fails to hold. We have 
used these results to look at the effects random edge weights have on information 
cascades, and have found evidence of interesting transitions and "tipping points" 
deserving of further study. These theorems give us the possibility to explore the 
totality of models of this general type with efficient computer algorithms. Although 



our primary motivations come from the models of Nier et al. |2007 and Gai and 



Kapadia 20101 for systemic risk in financial networks, it is possible that our results 



are applicable to a wide range of network phenomena. 
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Connectivity parameter z 



Figure 1: Experiment 1A: This figure shows how the size of global cascades on infinite 
networks is influenced by nonconstant but deterministic edge weights. The benchmark 
case with a = 0, shown by the red triangles, is the case of constant edge weights. The 
cascade size remains similar in the case with a = —0.5 (blue triangles). In both cases, 
the lower critical value of z is approximately 1. On the other hand, when a = 0.5 (black 
pluses), the lower cascade transition is shifted up to z ~ 1.5, and furthermore, the upper 
critical value is shifted beyond 10 (in fact it occurs around z = 14). 
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Connectivity parameter z 



Figure 2: Experiment IB: This figure shows how the size of global cascades on infinite 
networks is influenced by lognormally distributed edge weights, with means consistent with 
the cases of Figure [Tj Compared to the deterministic cases, both the upper and lower 
critical values have been shifted to higher z values. All three curves, if extended beyond 
z = 10 exhibit upper critical values beyond z = 14. 
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N=infty ER Graphs: mean cascade size vs <j) 
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Figure 3: Experiment 1C: This figure shows how the mean cascade size depends on both the 
connectivity parameter z and the threshold parameter 0, when the weights are lognormal 
with mean and standard deviation (1,0.5). 



22 




N=100 ER Graphs: mean cascade size vs <|> 
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N=1000 ER Graphs: mean cascade size vs 
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Figure 4: Experiment 2A: These three figures shows how the size of global cascades on 
finite Erdos-Renyi skeleton graphs is influenced by lognormally distributed edge weights, 
with means consistent with the cases of Figure [TJ The figures from top to bottom show the 
results for skeleton graphs of size 10, 100, and 1000 respectively; the four curves correspond 
to different values of the connectivity parameter z. In each case, both LTIA and Monte 
Carlo values are plotted as functions of the threshold parameter 0. 
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Figure 5: Experiment 2B: These four panels show results for a single Erdos-Renyi skeleton 
graph with iV = 100 and z = 3.5 and four different threshold parameter values = 
0.3, 0.4, 0.5, 0.6. The panels plot the node-by-node probabilities as computed by both 
analytic (red crosses) and Monte Carlo methods (blue pluses), in increasing order of their 
analytic values. The green circles represent the node degree as a fraction of the maximum 
node degree. 
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